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Abstract 



Most modern formalisms used in Databases and Artificial Intelligence for describing an 
application domain are based on the notions of class (or concept) and relationship among 
classes. One interesting feature of such formalisms is the possibility of defining a class, 
i.e., providing a set of properties that precisely characterize the instances of the class. 
Many recent articles point out that there are several ways of assigning a meaning to a 
class definition containing some sort of recursion. In this paper, we argue that, instead of 
choosing a single style of semantics, we achieve better results by adopting a formalism that 
allows for different semantics to coexist. We demonstrate the feasibility of our argument, by 
presenting a knowledge representation formalism, the description logic iiACCQ, with the 
above characteristics. In addition to the constructs for conjunction, disjunction, negation, 
quantifiers, and qualified number restrictions, fiAjCCQ includes special fixpoint constructs 
to express (suitably interpreted) recursive definitions. These constructs enable the usual 
frame-based descriptions to be combined with definitions of recursive data structures such 
as directed acyclic graphs, lists, streams, etc. We establish several properties of nACCQ, 
including the decidability and the computational complexity of reasoning, by formulating 
a correspondence with a particular modal logic of programs called the modal mu-calculus. 

1. Introduction 

Most modern formalisms used in Databases and Artificial Intelligence for representing an 
application domain are based on the notions of class (or concept) and relationship among 
classes. For example, the object-oriented and semantics data models developed in Databases 
describe data in terms of classes (sometimes called entity types) and incorporate several 
features for establishing various forms of relationships between classes. On the other hand, 
the notion of class (often called concept or frame) and that of link among classes are provided 
in all structured formalisms for Knowledge Representation (frame-based languages, semantic 
networks, description logics, etc.). Finally, this notion is also present in several type systems 
of programming languages, specially those based on the object-oriented paradigm. 

There are basically two ways of using and describing classes (concepts). In the first one, 
which we can call the prescriptive approach, the description formalism allows for expressing 
a number of properties of a class, thus prescribing constraints that the instances of the class 
must satisfy. In the second one, which we can call the definitional approach, the formalism 
allows for providing the definition of a class, i.e., a set of properties that precisely character- 
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ize the instances of the class. While the prescriptive approach is quite well understood and 
established, the definitional approach is still the subject of an interesting debate, regarding 
both its nature and its semantic foundation. In particular, it is well known that there are 
various ways to assign a meaning to a class definition when it contains some sort of recursion 
(Baader, 1990, 1991; Nebel, 1991; Beneventano h Bergamaschi, 1992; Beeri, 1990). 

In this paper, we are concerned with the semantic problems related to the definitional 
approach, arguing that, instead of choosing a single style of semantics for the knowledge 
representation formalism, we achieve better results by allowing different semantics to coex- 
ist. 

We discuss this issue in the context of Description Logics 1 , which are logics originally 
developed in Knowledge Representation to provide a formal reconstruction of frame-based 
languages. Description logics describe the domain of interest in terms of concepts, which 
represent classes of individuals, and roles, which are binary relations used to specify proper- 
ties or attributes of individuals as well as links with other individuals (Nebel, 1990) . Starting 
from atomic concepts, denoted simply by a name, more complex concepts are built by us- 
ing a suitable set of constructs. For example, the expression parent n male n \/child.male 
denotes the concept of father (male parent) whose children are all male. The symbol n 
denotes the construct for concept conjunction, while V denotes universal role quantification. 
Typically, concepts are structured into hierarchies determined by the properties associated 
with them. The hierarchical structure is defined in such a way that more specific concepts 
inherit the properties of the more general ones. 

We introduce a description logic, called uACCQ, which extends the well-known de- 
scription logic ACC (Schmidt-Schaufi & Smolka, 1991) by including the so called qualified 
number restrictions, which are a very general form of cardinality constraints on roles, and 
special fixpoint constructs, which enable us to capture the various semantics for recursive 
definitions within a single formalism. Notably, the availability of these constructs makes 
it possible to combine the usual frame-based descriptions with definitions of recursive data 
structures such as directed acyclic graphs, lists, streams, etc. 

We establish several properties of uACCQ, including the decidability and the compu- 
tational complexity of reasoning, by formulating a correspondence with a particular modal 
logic of programs called the modal mu-calculus. 

Recent articles, (e.g., Bergamaschi & Sartori, 1992; Borgida, 1992), advocate the use 
of description logics as a unifying framework for several types of database and knowledge 
representation formalisms. Indeed, it is possible to show that, depending on both the con- 
structs and the semantics used, one can capture several database models and programming 
language type systems by using description logics. Therefore, the study presented in this pa- 
per is not merely confined to description logics, but is also applicable to other representation 
formalisms. 

The paper is organized as follows. In Section 2, we present the basic notions regarding 
both description logics and fixpoints. In Section 3, we motivate our approach through 
a detailed discussion about the different semantics of concept definitions that have been 
considered in the literature, and we argue for a formalism in which the various semantics 
coexist. In Section 4, we present one such formalism, namely the logic uACCQ, and we 

1. Also called Concept Languages or Terminological Languages. 
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discuss several of its properties. In Section 5 we study reasoning techniques for \iACCQ 
and expose the correspondence with modal mu-calculus. Finally, in Section 6, we draw the 
conclusions and discuss some open problems. 

2. Preliminaries 

In this section, we briefly present the basic notions regarding both description logics, and 
fixpoints. The interested reader is referred to (Nebel, 1990) and (de Bakker, 1980) for a 
more complete introduction to the subjects. 

2.1 Description Logics 

Description logics allow one to represent a domain of interest in terms of concepts and 
roles. Concepts model classes of individuals, while roles model relationships between classes. 
Starting with atomic concepts (denoted by A) and atomic roles (denoted by R), which are 
concepts and roles described simply by a name, complex concepts and roles can be built by 
means of suitable constructs. 

In this section, we concentrate on the description logic ACCQ, obtained from the well- 
known description logic ACC (Schmidt-Schaufi & Smolka, 1991) by including qualified num- 
ber restrictions. These are cardinality constraints on the role fillers of a very general form, 
where role fillers to which a constraint applies are selected by means of a generic concept 
expression, the qualifier. 

ACCQ concepts (denoted by C or D, possibly with a subscript) are composed inductively 
according to the following abstract syntax (n denotes a natural number): 

C ::= A | T | _L | | d n C 2 | d U C 2 | 3R.C | Vi?.C | (< nR.C) | (> nR.C). 

These constructs are not all independent. The following equalities hold: T = A U ->A, 
_L = -.T, VR.C = ~^3R.^C, and (< n R.C) = -.(> n + 1 R.C). 

From a semantic point of view, concepts are interpreted as subsets of an abstract domain, 
while roles are interpreted as binary relations over such a domain. More precisely, an 
interpretation X = (A 1 , x ) consists of a domain of interpretation A 1 , and an interpretation 
function x mapping every atomic concept A to a subset of A x and every atomic role R to 
a subset of A x x A 1 . 

The interpretation function x is extended to complex concepts of ACCQ (note that in 
ACCQ roles are always atomic) as follows: 
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where #5 denotes the cardinality of the set S. 

A concept C is satisfiable iff there exists an interpretation 1 such that C x ^ 0, otherwise 
C is unsatisfiable. A concept C\ is subsumed by a concept C2, written as C\ C Ci, iff for 
every interpretation I, Cf C Cj. 

Our knowledge expressed in terms of concepts and roles is assembled into a special 
knowledge base, traditionally called TBox, which consists of a finite (possibly empty) set 
of assertions. In order to be as general as possible, we assume that every assertion has the 
form of an inclusion assertion (or simply inclusion): 

Ci E C 2 

without any restriction on the form of the concepts C\ and Ci- A pair of inclusions of the 
form {Ci C C2, C2 Q C{\ is often written as C\ = C2 and is called equivalence assertion. 

An interpretation I satisfies an inclusion C\ C C2 iff Cf C Cj. An interpretation I is 
a model of a TBox K, iff X satisfies all inclusions in K. 

Let /C be a TBox. We say that a concept C is satisfiable in IC, iff there exists a model 
I of K, such that C x 7^ 0, unsatisfiable otherwise. We say that a concept C\ is subsumed by 
a concept C2 in /C, written K, \= C\ C C2, iff for every model X of /C, Cf C C^. 

2.2 Fixpoints 

We briefly recall few notions on fixpoints. Consider the equation: 

X = f(X) 

where / is an operator from 2 s to 2 s (2 s denotes the set of all subsets of a set S). Every 
solution £ of this equation is called a fixpoint of the operator / (while every set £ such that 
f(£) C £ is called pre-fixpoint, and every set £ such that £ C /(£) is called post- fixpoint). 
In general, an equation as the one above may have either no solution, a finite number of 
solutions, or an infinite number of them. Among the various solutions, the smallest and the 
greatest solutions (with respect to set-inclusion) have a prominent position, if they exist. A 
fundamental result due to Tarski (Tarski, 1955) guarantees the existence and the uniqueness 
of both such solutions in case / is monotonic wrt set-inclusion (C), where / is monotonic 
wrt C whenever £\ C £ 2 implies f{£\) C /(^)- 

Theorem 1 (Tarski) Let S be a set, and f an operator from 2 s to 2 s that is monotonic 
wrt C. Then: 

• There exists a unique least fixpoint of f , which is given by f]{£ C S \ f(£) C £}. 

• There exists a unique greatest fixpoint of f, which is given by \J{£ C S | £ C f(£)}. 

3. Concept Definitions as Equations 

We now analyze the notion of concept definition in detail. Let us ignore for the moment 
knowledge bases as they have been introduced in the previous section, and let us consider a 
different kind of assertion: the definition statement. Let the form of a definition statement 
(or simply definition) be: 

A =def C 
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where A is an atomic concept which cannot appear in the left-hand side of other definition 
statements, and C is a concept expression in ACCQ. In principle, A =def C is intended to 
provide an exact account for the concept to A in terms of C, i.e., to define A as the set of 
the individuals satisfying C. 

In specifying the semantics of definitions, we need to distinguish between two different 
types of atomic concepts, namely, primitive concepts and defined concepts. Given a set of 
definition statements, the primitive concepts are the atomic concepts that do not appear in 
the left-hand side of any definition statement, whereas the defined concepts are those that 
appear in the left-hand side of a definition statement. 

Given an interpretation X = (A 1 ,- 1 ), the interpretation function x directly assigns 
a subset of A x to primitive concepts, but not to defined concepts. The meaning of a 
defined concept A is assigned through its definition statement A =def C, extending the 
interpretation function so as the following requirement is satisfied: 

A 1 = C x . (1) 

Consider, for example, the definition statement: 

parent =def 3child.T. 

Note that the defined concept parent does not appear in the body of its definition statement. 
By (1), the definition statement provides the definition for the concept parent, in the 
following sense: in any interpretation 1 = (A 1 , parent 1 denotes a single subset of A x , 
exactly the one denoted by (3child.T) x , i.e., {s \ 3t.(s,t) 6 child 1 }. In general, if a concept 
A is defined in terms of primitive and already defined concepts, then for every interpretation 
I there exists a unique way to extend the interpretation function to defined concepts, and 
hence there is no doubt that the definition statement provides a definition of A. 
Now, consider the following definition statement: 

A =def 3child.A. 

Given an interpretation 1 = (A 1 , x ), by (1) the statement is interpreted as the equation: 

A x = {s 6 A x | 3t.(s,t) € child 1 and t £ A 1 }. 

However such equation does not specify univocally how to extend the interpretation function 
■ x to the defined concept A, since satisfies the equation as well as any set of individuals 
where each member has an infinite chain of descendants that are also members. 

In general, we call recursive definition statements 2 (or simply recursive definitions), 
definition statements of the form: 

A = def F(A) 

where F(A) stands for a concept that has A as a subconcept 3 . According to (1), the recursive 
definition A =def F{A) is interpreted simply as a sort of equation specifying that, given an 

2. Terminological cycles in (Baader, 1990, 1991; Nebel, 1991). In the present discussion, for sake of sim- 
plicity, we do not consider mutual recursive definitions, as A =def F(B), B =def F'(A). We shall come 
back to this point later on. 

3. A subconcept of a concept C is any substring of C (including C itself) that is a concept, according to 
the syntax rules. 
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interpretation X, the subset of A x that can be tied to the defined concept A must satisfy 
the equation A x = F(A) X , i.e., must be one of its solutions. Observe that, in general, either 
none, one, or several subsets of A 1 may exist which are solutions of the above equation. 

Another convenient way to consider a definition statement is to associate to it, for every 
interpretation X, an operator from subsets of A 1 to subsets of A 1 instead of an equation, so 
that the fixpoints of the operator correspond to the solutions of the equation. For example, 
to the definition A =def 3child.A we associate the operator: 

XS.{s € A 1 | 3t.(s,t) € child 1 and t E S} 

for any interpretation I. In general as either none, one or multiple solutions exist for the 
equation associated with a recursive definition, we have that either none, one or multiple 
fixpoints exist for the corresponding operator. 

In this situation the word "definition" itself seems misleading: the body of the definition 
does not give a complete account of the defined concept. An additional criterion is needed 
for selecting solutions of the associated equation, or equivalently, fixpoints of the associated 
operator. In other words in addition to (1), a criterion is needed to extend univocally the 
interpretations I to the defined concepts. This observation has led to various semantics, 
each of which interprets recursive definitions differently, by choosing, a priori and once and 
for all, which solutions, or equivalently which fixpoints, are to be assigned to the defining 
concept of a recursive definition 4 . 

3.1 Different Semantics for Recursive Definitions 

In the literature on description logics, three semantics for recursive definitions have been 
proposed (see Nebel, 1991): 

• Descriptive Semantics 

• Least Fixpoint Semantics 

• Greatest Fixpoint Semantics 

and which of these semantics is the "right" one is a long standing matter of debate. Below 
we describe how each of the three semantics interprets recursive definitions, and present 
some examples showing that the choice of the semantics depends in fact upon the concept 
to be defined. But first, it should be stressed that only the descriptive semantics is able to 
assign meaning to general inclusion assertions C\ C C2 introduced in the previous section. 

According to the Descriptive Semantics, a recursive definition A =def F(A) is a con- 
straint stating that, for any X, A 1 has to be a solution of the equation A x = F(A) X . Under 
the descriptive semantics, A =def 3child.A simply states that the individuals in the class 
A have a child in the class A, and the individuals that have a child in the class A are 
themselves in the class A, where A is no better specified. In general the descriptive seman- 
tics is not appropriate to properly define recursive concepts, in the sense that, given an 

4. We remark that a non-recursive definition is interpreted by the various semantics in the same way, since, 
for every I, the equation associated to it has a single solution. 
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interpretation X, it is unable to assign a unique subset of A x to the denned concept of the 
recursive definition. 

In fact under descriptive semantics definition statements are indistinguishable from the 
equivalence assertions introduced in the previous section. In other words, the meaning 
assigned to A =def F(A) is the same as that assigned to the equivalence assertion A = F(A). 
Although such equivalence assertions can be used to specify if-and-only-if constraints, they 
do not provide proper definitions when recursion is involved. For example, we can express 
the fact that humans are mammals having parents that are humans, and on the converse, 
that mammals having parents that are humans are humans themselves, in terms of the 
equivalence assertion: 

human = mammal n 3parent.T n V 'parent. human. 

Similarly we may require horses to satisfy an analogous property: 

horse = mammal n 3parent.T n Vparent. horse. 

However the two equivalence assertions above do not define human and horse as shown, 
e.g., by the fact that (correctly) they do not imply that all humans are horses and vice- versa 
(in contrast to what happen when a fixpoint semantics is used, see below). 

The Least Fixpoint Semantics interprets a recursive definition statement A =def F(A) 
by assigning to A the smallest possible extension in each interpretation X, among those that 
satisfy A x =def F(A) X - i.e., the least fixpoint of the corresponding operator. In fact it is 
always assumed that the operator associated with the definition statement is monotonic, 
so that Theorem 1 applies and a least fixpoint exists and is unique, i.e., the corresponding 
equation has a unique smallest solution. Hence under the least fixpoint semantics the 
recursive definition statement A =def F(A) defines the concept A. It is easy to verify that 
in the example A =def 3child.A, the least fixpoint semantics leads us to identify A with _L. 
Indeed the empty set is a solution of the equation associated with the statement, and it is 
obviously the smallest solution. Similarly if we interpret the definition statement: 

human =def mammal n 3parent.T n Vparent. human 

with the least fixpoint semantics, we have that human 1 = for any interpretation 1. 
Observe that if, as above, we adopt a similar definition for horse, we get again horse 1 = 0, 
so we can trivially infer that horse = human. 

The least fixpoint semantics is particularly suitable for providing inductive definitions 
of concepts. For example, consider the class of a list (LIST) defined as follows: 

• An EMPTY-LIST is a LIST. 

• A NODE that has exactly one successor that is a LIST is a LIST. 

• Nothing else is a LIST. 

The first two conditions can be captured by the following recursive definition statement 5 : 

list =def emptylist U (node l~l (< 1 succ.T) l~l Bsucc.list) 
5. Additionally we specify that the two concepts emptylist and node are disjoint. 
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where (< 1 succ.T) forces succ to be a function. To enforce the third condition we must 
assign the smallest possible extension to list. Thus, the class of LISTs can be naturally 
defined by means of the above definition statement, interpreted according to the least 
fixpoint semantics. 

The Greatest Fixpoint Semantics interprets a recursive definition statement A =def F(A) 
by assigning to A the largest possible extension in each interpretation 1, among those that 
satisfy A x =def F{A) X - i.e., the greatest fixpoint of the corresponding operator. Again, it 
is assumed that such operator is monotonic in order to guaranty the existence an the unicity 
of the greatest fixpoint (Theorem 1). As for the least fixpoint semantics, under the greatest 
fixpoint semantics a recursive definition statement A =def F(A) defines the concept A. For 
example, considering again the definition statement A =def ^child.A, the greatest fixpoint 
semantics leads us to interpret A as the class of all the individuals having a child that in 
turn is a member of A. 

While the least fixpoint semantics naturally captures classes defined by induction, the 
greatest fixpoint semantics naturally captures classes of individuals whose structure is non- 
well-founded or co-inductive. An example is the class of STREAMS, modeling the well- 
known linear data structure having a NODE as first element, and such that the rest of the 
structure is a STREAM itself. Note that streams are similar to lists except that while lists 
can be considered as finite sequences of nodes, streams are infinite sequences of nodes. Such 
a class can be captured by the following recursive definition statement: 

stream =def node n (< 1 succ.T) n 3succ. stream 

with the proviso that the greatest possible extension is assigned to stream. 

Finally, consider under the greatest fixpoint semantics the recursive definition state- 
ments: 

human =def mammal l~l Bparent.T l~l V 'parent. human 

horse =def mammal l~l Bparent.T l~l Vparent. horse. 

Although they do not assign the empty extension to both human and horse as the least 
fixpoint semantics does, we do have again the rather counter intuitive consequence that 
human = horse, since human 1 = horse 1 for any interpretation I. In general under both 
types of fixpoint semantics the particular name used to denote a defined concept does not 
have any impact on the interpretation of it, since the meaning of the defined concept is 
completely specified by its definition statement. 

3.2 Least and Greatest Fixpoints as Concept Constructs 

The above considerations show that arguing about which is the "right" semantics for recur- 
sive definitions is not really an issue. Each of them captures an essential use of recursive 
equations: the descriptive semantics is appropriate to specify constraints on concepts and 
is the only one that extends to the general inclusion assertions introduced in Section 2; the 
least fixpoint semantics is appropriate to define a structure inductively; the greatest fixpoint 
semantics is the appropriate to define non-well-founded structures. Generally, we may need 
the three of them in the same knowledge base, in order to model the various properties of 
the different concepts. 
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Our proposal in this paper is exactly in the direction of reconciling the various semantics 
in the same knowledge base. This is pursued by means of a logic that incorporates two 
constructs, nX.F(X) and vX.F(X) (the symbols X, Y, ... stand for concept variables), 
denoting, respectively, the least fixpoint and the greatest fixpoint of the operator associated 
with the definition X =def F(X), that is, for every X, the smallest solution and the greatest 
solution of the equation X x = F(X) X . 

In our approach, definition statements will never appear in a TBox. Instead, a knowledge 
base will be simply a set of inclusion assertions (interpreted according to the descriptive se- 
mantics) that can involve fixpoint constructs. For example, in order to specify the properties 
of the concepts of list, stream, human and horse we can use the following assertions 6 : 

list = uX. emptylist U (node n (< 1 succ.T) n 3succ.X) 

stream = vX.node l~l (< 1 succ.T) l~l Bsucc.X 

human = mammal l~l Bparent.T l~l Vparent. human 

horse = mammal n 3parent.T n V 'parent. horse. 

Note that, if we then add to the above knowledge base the equivalence assertion: 

mgm = vX . mammal V\ 3parent.~T n \lparent.X 

defining the concept mgm (mammal generated by mammal), then it correctly turns out 
that both human and horse are subsumed by mgm. 

The availability of least and greatest fixpoint constructs, by allowing different semantics 
to be used in the same TBox, makes it possible to model not only abstract classes, but also 
inductively and co- inductively defined data structures, such as lists and streams. This is 
particularly important if our objective is to integrate class-based representation formalisms 
and programming systems (declarative or procedural), in order to make these formalisms 
more useful in practice. Furthermore, we have the possibility of nesting flxpoints, thus 
going beyond the simple equational format by which we motivated their introduction. As 
an example, consider the following one: 

Among the inhabitants of the planet "Plonk" , a disease called "foo" is quite 
common. Such a disease manifests itself in two forms: a "visible" one and a 
"latent" one, and it has a rather intricate hereditary pattern. Individuals that 
have the visible form transmit the visible form to at least one (say the first) direct 
descendant (obviously, if there is any direct descendant), these ill descendants 
in turn do the same, and so on, until someone transmits the latent form of the 
disease. More precisely, along any chain of descendants, the visible form of the 
disease sooner or later is interrupted, because either an individual has no direct 
descendant or an individual transmits to some descendant the latent form. All 
direct descendants (if any) of an individual that has the latent form inherit the 
visible form. The pattern goes on like this, generation after generation, forever. 

The hereditary pattern (fooJhp) of the above disease can be defined as follows: 

fooJhp = vX.nY. ((visible n (3child.Y U Vc/ii/d. _L))U 
(latent F\ \/child.(visible F\ X))) 

6. We also include the assertion emptylist C -^node, specifying that the concepts emptylist and node are 
disjoint. 
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where visible and latent denote the visible and the latent form respectively of the disease, 
and are assumed to be disjoint (latent C -^visible). 

4. The Description Logic fxACCQ 

We provide a formal account of the meaning of the fixpoint constructs by introducing a 
description logic, called pACCQ, which is obtained by adding these constructs to ACCQ. 

We make use of the notions of scope, bound and free occurrences of variables, closed 
formulae, etc. The definitions of these notions are the same as the analogues in first-order 
logic, treating p and v as quantifiers. In addition, we use the symbol a as an abstraction 
for either \x or v. 

The primitive symbols in pACCQ are atomic concepts, (concept) variables (denoted by 
X, Y", . . .), and atomic roles which are the only roles admitted in the logic. 

Concepts in pACCQ are formed inductively according to the following abstract syntax: 

C ::= A | T | J_ | -.C | C\ n C 2 \ C\ U C 2 \ 3R.C \ VR.C | (< nR.C) | (> nR.C) \ 
pX.C I vX.C | X 

where A denotes an atomic concept, R an atomic role, n a natural number, and X a variable, 
and with the restriction that only a variable X occurring positively in C can be bounded by 
a fixpoint a in oX.C . We say that a variable X occurs positively in a concept C, if every 
free occurrence of X is in the scope of an even number of negations, considering concepts 
C in (< nR.C) in the scope of one negation. 

The two fixpoint constructs are mutually definable: vX.C = ~^nX.^C[X/^X] (where 
C[X/-<X] is the concept obtained substituting all free occurrences of X by the concept 
•V). 

As before, an interpretation 1 = (A 1 , - x ) consists of a domain of interpretation A 1 , and 
a interpretation function which maps every atomic concept to a subset of A 1 , and every 
atomic role to a subset of A x x A x . But the presence of free variables does not allow us to 
extend the interpretation function - 1 directly to every concept of the logic. For this reason 
we introduce valuations. A valuation p on an interpretation I is a mapping from variables 
to subsets of A x . 

Given a valuation p, we denote by p[X/£] the valuation identical to p except for 
p[X/S](X) = £. In other words, for every variable Y: 



= { my) ^-7x 

Let 1 be an interpretation and p a valuation on 1. We assign meaning to concepts of 
the logic by associating to I and p an extension function - x pl mapping concepts to subsets 
of A 1 , as follows: 
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Xl = P (X)CA I 

A* = A I CA I 

= A 1 

^ = 

K7)J = A x -Cf 

(dnc 2 )* = (Ci)*n(c 2 )* 

(CiUC 2 )^ = (Ci)^U(C 2 )J 

(3i2.C)J = {s 6 A 1 3s'. (a, a') G i? 1 and a' G C*} 

{VR.C)j = {a G A 1 | Va'. (a, a') G i? J implies a' € Cj} 

(< ni?.C)J = {a € A 1 | #{a' | (a, a') G R x and a' G Cf} < n} 

(> n i?.C)J = {a G A x | #{s' | (a, a') G R x and a' G C*} > n} 

(pX.Cf p = ni^A 1 ! Cg x/£] C5} 

W = UI^A 1 ! 5C Cj [x/£] } 

In the last two cases * s interpreted as an operator from subsets £ of A 1 to subsets 

of A 1 . By the syntactic restriction enforced on variables, such an operator is guaranteed 
to be monotonic wrt C. Notice that free variables appearing in a concept are interpreted 
similarly to atomic concepts. 

A concept C is satisfiable, if there exists an interpretation X and a valuation p on X 
such that C x ^ 0, otherwise C is unsatisfiable. A concept C\ is subsumed by a concept C 2 , 
written as C\ C C 2 , if for every interpretation X and every valuation p on X, (Ci)J C (C 2 )J. 

A /liACCQ TBox is a finite (possibly empty) set of inclusion assertions Ci C C 2 where 
Ci and C 2 are closed concepts of pACCQ. As before, we use equivalence assertions of the 
form C\ = C 2 as an abbreviation for {C\ C C 2 , C 2 C Ci}. 

An interpretation I satisfies an inclusion assertion C\ C C 2 , if (Ci) x C (C 2 )^, where /? 
is any valuation on I (being Ci and C 2 closed, and hence independent from valuations). 
X is a model of a TBox /C, if X satisfies all inclusion assertions in K,. We say that a TBox 
K, is satisfiable, if it has a model. Observe that inclusion assertions in K, are interpreted 
according to the descriptive semantics. 

We say that a TBox K, logically implies an inclusion assertion C\ C C 2 , written K, \= 
C\ C C 2 , if for every model X of K, and every valuation p on X, {C\) x C (C 2 )^. 



4.1 Properties of the Fixpoint Constructs 

In the following, we use the notation C(X) to indicate that the variable X occurs free in 
the concept C (other variables could occur free in C as well), and the notation C(D), where 
D is a concept, as a shorthand for C(X)[X/D] (i.e., the concept obtained substituting all 
free occurrences of A in C(A) by the concept D). 

Let us comment briefly on some simple properties of the logic. First, the concept 
aX.C(X) is equivalent to the concept aY.C(Y), as long as Y is free for A in C(A). Second, 
the extension function x assign to a closed concept a value which is independent of the actual 
valuation p. Hence aX.C, where A does not occur in C, is equivalent to C. Third, since 
crA.C(A) is a fixpoint we have that C(aX.C(X)) is equivalent to crA.C(A). Furthermore, 
we have that the concept |uA.C(A) is always subsumed by the concept uX.C(X). 
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The next property is more substantial. Consider the class of a single source finite 
directed acyclic graphs (DAGs) defined inductively as follows 7 : 

• The EMPTY-DAG is a DAG (base step). 

• A NODE that has connections and all connections are DAGs, is a DAG (inductive 
step). 

• Nothing else is a DAG. 

Consider now a pACCQ TBox K, containing the two equivalence assertions: 
dag-of student = pX . emptydag U (student n 3arc.T n Varc.A) 

dag -of ..person = pX . emptydag U (person n 3arc.~T n Varc.A) 

which define the concepts dag-of student and dag-of -person as the classes of DAGs whose 
nodes are students and persons respectively. Assuming that students are persons, we want 
to be able to infer that DAGs of students are DAGs of persons as well. That is, we want: 

K, \= student C person implies K, \= dag -of student C dag -of -person. 

It turns out that for pACCQ such a property holds. To prove this, we introduce the 
following two theorems. 

Theorem 2 Let K be a pACCQ TBox, and C and D two pACCQ concepts in which a 
variable X occurs free positively. Then: 

JC |= C C D implies JC \= aX.C C oX.D. 

Proof We proceed by contradiction 8 . Assume that Cp C D T p holds for all models I of K 
and all valuations p on T. And suppose that there exists a model T of K, and a valuation p 
on 1 such that (oX.C) x p % (oX.D) x p . 

First we prove the result for o = p. Let s be an individual in (pX.C) 1 but not in 
(pX.D) x . Now, we have: 

s G (pX.Cfp iff VKA 1 . (Cp [x /£] C S implies s e £) (2) 

s £ (pX.Df p iff 3£' C A 1 . (D X p [x/£l] C £' and s <££'). (3) 
For the set £' in (3), the following expression holds: 

C p[X/£>] ^ D p[X/£'] ^ £> 

7. We assume that a leaf of a DAG is a NODE with all arcs leading to a special DAG called EMPTY-DAG. 
As an alternative, one can assume that a leaf of a DAG is a NODE having no connection at all. In the 
latter case, the definition of dag would simplify to dag =def node l~l Varc.dag (in which the general form 
of inductive definitions - i.e., base case and inductive case - is less apparent). 

8. For uniformity, we do not distinguish if X occurs free or not. Obviously if X does not occur free, the 
result is trivial. 
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hence by (2) we have s E £' and by (3) we have s $ £', which is impossible. 

The proof for a = v is similar. Let s be an individual in {yX.C) T p but not in {yX.D) T p . 
Now, we have: 

s E [yX.Cfp iff 3£" C A 1 . (5" C Cj [x/£ „] and s E 5") (4) 

s £ {vX.Dfp iff VKA J .(K D j[x/£] implies s g 5). (5) 
For the set £" in (4), the following expression holds: 

£" Q Cp[x/£"] ^ D p[x/£"] 
hence by (4) we have s E £" and by (5) we have s which is impossible. □ 

Above we have defined what it means for a variable X to occur positively in a concept C. 
Similarly we say that a variable X occurs negatively in a concept C, if every free occurrence 
of X is in the scope of an odd number of negations, considering concepts C in (< nR.C) 
in the scope of one negation. 



Theorem 3 Let K be a ixACCQ TBox, and D(X) a ixACCQ concept with the variable X 
as a free variable. Then, for any fiACCQ concepts C\ and C: 

^ , „ ,_ „ . ,. I K, \= D(C\) C DiCo) if X occurs positively in D(X) 
K. \= d C C 2 implies { ^ 1 " 2 .{ ,• , • n J 

I /C |= Dybi) E -^(^l) y A occurs negatively in L>(X) 



Proof We prove the result by induction on the formation of D(X). 
Base case. If D(X) = X, the result holds trivially. 

Inductive cases. If D(X) has the form -<D'(X) \ (> nR.C) , then X occurs positively 
(negatively) in D'(X) and negatively (positively) in D(X). By induction hypothesis /C |= 
D'(Ci) C D'(Cj) (where i, j E {1, 2} and i 7^ j) and hence by the semantics of the constructs 
K\=D(Cj)QD(Ci). 

If D{X) has the form D[(X) n ^(-X") I ^i(^) U ^(-X") I VR.D'(X) | (> nR.D'(X)), 
then X occurs positively (negatively) both in D'(X) and in -D(X). By induction hypothesis 
K, |= D'(Ci) C D'(Cj) and hence by the semantics of the constructs /C |= D(Ci) C D(Cj). 

It remains to prove the result for -D(X) = cry.D'(X) (y 7^ X). In this case, by the 
syntactic restriction enforced, Y occurs positively in D'(X) and hence by Theorem 2 we have 
K, \= D'(d) C D'(Cj) implies K, \= aY.D'{Ci) C <rY.D'(Cj), thus by induction hypothesis 
we are done. □ 



Going back to our example, we can, in fact, infer that DAGs of students are also DAGs of 
persons. Indeed, by applying Theorem 3 and then Theorem 2, we have that K, \= student C 
person implies K, \= fj,X.emptydagU(studentr~\Barc.T\~Narc.X) C /j,X.emptydagU(person\~\ 
3arc.T fl \/arc.X). 
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4.2 Internalizing Assertions 

We now show that logical implication in pACCQ TBoxes (thus also satisfiability of pACCQ 
TBoxes) is reducible to unsatisfiability of a single pACCQ concept. To prove this result, we 
introduce the notions of generated sub-interpretation and sub- valuation 9 . 

Let X = (A 1 , x ) be an interpretation, p a valuation on X, and s £ A x an individual. 
We define the interpretation X s = (A x \ - x °), and the valuation p s on I s , as follows: 

. A x ° = {s 1 £ A x | (s, s') £ U . . . U 

• For each atomic role we have = Rf n (A IS x A 1 *). 

• For each atomic concept A, we have = A 1 n A IS . 

• For each variable X, we have p s {X) = p{X) n A IS . 

We call X s i/ie sub-interpretation of I generated by s, and p s the sub-valuation of p generated 
by s. 

For generated sub-interpretations and sub- valuations we can state the following lemma. 

Lemma 4 Let C be a pACCQ concept. Then for any interpretation X, any valuation p on 
X, and any individual s 6 A 1 , we have: 

Vt e A xs . t e iff *eCj\ 

Proof Without loss of generality, we consider concepts formed according to the following 
simplified abstract syntax: C ::= A \ _L | | C\ n C 2 | 3i?.C | (> ni?.C) | /liXC | X. 

We prove the result by induction on the number of nested fixpoint constructs. Base 
case. If in C there are no fixpoint constructs, the thesis can be proven by induction on the 
formation of C. 

Inductive case. We assume that the thesis holds for concepts C with k nested fixpoint 
constructs, and we prove it for concepts pX.C with k + 1. We recall that, by the Tarski- 
Knaster Theorem on fixpoints (Tarski, 1955), t £ (pX.C) x iff there exists an ordinal a such 
that t £ (p a X.C) x , where (p a X.C) x is defined by transfinite induction as: 

. (p X.C) x = 

• (p a+1 x.c) x = c x {xj{jXaXC) ^ 

• (p\X.C) x p = \J a<x (p a X.C) x , if A is a limit ordinal. 

Hence we proceed by transfinite induction on ordinals a. 

Base case of the transfinite induction. pqX.C is defined as _L, thus trivially we have 

te(p x.c) x iff te(p x.c) x :. 

Successor case of the transfinite induction. We want to show that t £ {p a+ \X.C) x iff t £ 
(p a+ \X.C) X s , which reduces to: 

1 G c p[x/(^x.c)j] iff 1 G c ^[x/i^x.c)^y ( 6 ) 

9. Together these notions play the same role as that of generated sub-model in modal logics. 
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To prove this, we start by showing that: 

f E C p°[X/(» a X.C)Z s 3 ] lff 1 G C fp[X/(n a X.C)*])>- ( 7 ) 

Notice that the two valuations above may differ only on the value of X. If it holds that: 

* € X f[x/(ji a x.c)? s ] iff * G XfpW^X-C)*])" ( 8 ) 

then by straightforward induction on the formation of C we have that (7) holds as well. 
Let us prove (8). We can write it as: 

1 6 p s [xi^ a x.c) x ;](x) iff t e ( P [x/(f, a x.cf p ]y(x), 

and since t 6 A 1 ' , this reduces to 

t e {n a x.cf p : iff t e (p a x.cf p . 

which holds by transfinite inductive hypothesis. 

Now, since C contains k fixpoint constructs, by inductive hypothesis on k, we have: 

f G C p[x/(^x.c)j] lff * G c fp[x/(p, a x.c)Z]y 

Hence, considering (6) and (7), it follows that indeed t 6 (/j, a+ iX.C) x iff t 6 (p a+ iX.C)^l . 

Limit case of the transfinite induction. Let A be a limit ordinal, then t € {ix\X.C) x iff 
there exists an ordinal a < A such that t 6 (/j, a X.C) x . By transfinite induction hypothesis, 
it holds that: £ 6 (/j, a X.C) x iff t 6 (/ji a X.C) X s , and thus: 

te(/x A XC)J iff te(/i A XC)j;. 

This completes the transfinite induction. So for all ordinals a it holds that: 

teivaX.Cfp iff t£(/i Q XC)j:. 

The induction on the nesting of fixpoint constructs is completed as well, hence we have 
proven the lemma. □ 

Now we are ready to state the result mentioned above. 

Theorem 5 Let K = {C t C D 1 ,...,C q C D g } be a pACCQ TBox, and C and D two 
fiACCQ concepts. Then K, |= C C D if and only if the fiACCQ concept: 

vX.(vr 1 .x n . . . n VR m .x nc K )ncn^D (9) 

is unsatisfiable, where R\, . . . , R m are all the atomic roles appearing in K,, and Cjc = ( _, Ci U 

Di)n...n(-.CgUDg). 
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Proof If part. By contradiction. Assume that (9) is not satisfiable, and suppose that 
JC y= C C -D, i.e., there exists an interpretation X, and a valuation p on X, such that X 
is a model of K and Cj <2 D x . It follows that, there exists an individual s G A 1 such 
that s 6 C x and s G (-<D) X . On the other hand, the fact that X is a model of K, implies 
that {Ctcfp = A x , and thus that (vX.(VR 1 .X U...U Vi? m .A n C K )) x p = A 1 . So we have 
s G (vX.{\/Ri.X n . . . n VR m -X n C/c) n C n -.-D)^, i.e., (9) is satisfiable, contradicting the 
hypotheses. 

Only If part. Again we proceed by contradiction. Assume K, |= C C D. And suppose 
that (9) is satisfiable, i.e., there exists an interpretation X, a valuation /? on X, and an 
individual s G A J , such that s G {vX.{\/Ri.X U...U Vi? TO .A n C K ) U C U ^B) x p . 

Now consider the sub-interpretation X s = (A xs , • J« ) and the sub- valuation p s on X s 
generated by s. On the one hand, we clearly have that {Cjc) x 1 = A x \ hence X s is a model 
of/C. On the other hand by Lemma 4 s G (i^X(Vi?i.Xn. . .nVfl m .XnCj C )nCn-..D)^', so it 
follows that X s and /? s do not satisfy the subsumption C Q D, contradicting the hypotheses. 
□ 

This result states that satisfiability of pACCQ concepts and logical implication in 
pACCQ TBoxes (and thus of satisfiability of pACCQ TBoxes) are not distinct reasoning 
tasks. Hence in the following we will limit our attention to concept satisfiability without 
loss of generality. 

5. Reasoning with Fixpoints 

In this section we concentrate on developing reasoning methods to check for satisfiability 
concepts involving fixpoints. In particular, we exhibit a correspondence between pACCQ 
and a well-known logic of programs, called modal mu-calculus (Kozen, 1983; Kozen & 
Parikh, 1983; Streett & Emerson, 1984, 1989), that has been recently investigated for 
expressing temporal properties of reactive and parallel processes (Stirling, 1992; Larsen, 
1990; Cleaveland, 1990; Winsket, 1989; Dam, 1992). 

To get a better insight on the correspondence between the two logics, we first study 
the sublanguage pACC obtained from pACCQ leaving out qualified number restrictions 10 . 
Then, we study the full logic pACCQ. 

5.1 Reasoning in pACC 

Let us introduce modal mu-calculus formally. Formulae of modal mu-calculus 

are formed inductively from atomic formulae A, . . . and variables A, . . . according to the 
following abstract syntax: 

$,*::= A | T | _L | | $ A * | $ V * | (a)$ | [a]$ | pX.<f> | vX.$ \ X 

where a is the generic element of a set of labels C, and every bounded occurrence of every 
variable X must be in the scope of an even number of negation signs. 

10. Observe that, in Theorem 5 qualified number restrictions play no role. Hence exactly the same reduction 
from logical implication to unsatisfiability holds for fiACC as well. This allows us to restrict our attention 
to satisfiability only. 
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The semantics of modal mu-calculus is based on the notions of (Kripke) structure and 
valuation. A Kripke structure Ai is a triple (5, {Hi \ i G £}, V), where S is a set of states, 
each IZi is a binary relation on 5, and V is a mapping from atomic formulae to subsets of 
S. A valuation p on Ai is a mapping from variables to subsets of S. To a Kripke structure 
Ai and a valuation p on Ai, it is associated an extension function defined inductively 
as follows: 

X? = p(X)CS 

= V(A) C S 
= S 

±P = 

H&)^ = s-<s>y 

(<£ v ^) M = $ M u 

((a)$)f = {s£S\3s'. (s,s') Glands' G 
([a]$)^ = {.s G 5 | Vs'. (s, a') G TZ a implies a' G 

= n{^c5| ^a} 

A formula <3? is satisfiable if there exists a Kripke structure Ai and a valuation p on A4 such 
that / 0. 

The following theorem is the basis for the correspondence between pACC and the modal 
mu-calculus. 



Theorem 6 There exists a one-to-one linear-time function q mapping concepts of pACC 
to formulae of modal mu-calculus such that: for any pACC concept C , C is satisfiable if 
and only if q{C) is satisfiable. 

Proof We can define q in the following way: q(A) = A (atomic concepts are mapped 
to atomic formulae), q(X) = X, q(T) = T, q{±) = _L, q{^C) = -^q(C), q{3R.C) = 
(R)q(C) (atomic roles are mapped to labels), q^iR.C) = [R]q(C), q(pX.C) = pX.q(C), 
and q{uX.C) = vX.q{C). 

An interpretation X = (A x , ■ x ) is equivalent to a Kripke structure Ai = (S, {IZi \ i G 
£}, V) such that: S = A 1 ; C is equal to the set of names of the atomic roles interpreted 
in I; TZr = R x for each atomic role R; and V(A) = A 1 for each atomic concept A. In 
addition, a valuation p on 1 is equivalent to a valuation p' on X. Now both the extension 
function associated with 1 and p and the extension function associated with Ai and p' 
map, respectively, any concept C and the corresponding formula q(C) to the same subset 
of A 1 = S. Hence the thesis follows. □ 



It follows that we may transfer both decidability and complexity results for the modal 
mu-calculus (Kozen & Parikh, 1983; Emerson & Jutla, 1988; Safra, 1988) to pACC. Thus, 
we can immediately state what is the complexity of reasoning with pACC concepts and 
pACC TBoxes. 

Theorem 7 Satisfiability of pACC concepts, satisfiability of pACC TBoxes, and logical 
implication in pACC TBoxes are EXP TIME- complete problems. 
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Proof The satisfiability problem for modal mu-calculus is EXPTIME-complete (Emerson 
Sz Jutla, 1988), hence by Theorem 6 and by Theorem 5 the thesis follows. □ 

5.2 Reasoning in fiACCQ 

Next we exhibit a mapping from uACCQ concepts to formulae of variant of modal mu- 
calculus, called deterministic modal mu-calculus, which has the same syntax as the modal 
mu-calculus, but is interpreted over deterministic Kripke structures, that is Kripke struc- 
tures M. = (S,{lZi | i £ £},V) in which the relations IZi are partial functions (Streett h 
Emerson, 1984). 

Let us ignore for a moment the qualified number restriction constructs. Formulae of 
\xACCQ without qualified number restrictions are, in fact, formulae of the modal mu- 
calculus, as shown in the previous section. By using a well-known technique developed 
for propositional dynamic logic (Parikh, 1981), (nondeterministic) modal mu-calculus for- 
mulae can be reduced to deterministic modal mu-calculus formulae (Streett & Emerson, 
1984), as shown below. 

We use the following notations for usual operations on binary relations: -o- for chaining, 
•* for reflexive transitive closure, - + for transitive closure, and -~ for converse. We also use 
the following abbreviations: 



The reduction is as follows: in a formula $, we recursively replace each subformulae of 
the form [R]4> by [R][{R new )*]4> and each subformulae of the form (R)4> by (R)({R new )*)4>, 
where R new is a new symbol and both R and R new m the resulting formula are interpreted 
as partial functions. Let us call the resulting formula we have that $ is satisfiable if and 
only if $' is satisfiable. 

We briefly sketch the reasoning behind the proof of this statement. The if direction 
is easy: it suffices to observe that if M D = {S D ,{Rf \ i £ C D },V D ) is a model of 
then can transform it into a model A4 = (S,{lZi \ i £ £},V) of <3? by defining S = S D , 
£ = CP - new, n R = TZ^o {K% ew )*, and V = V D . The only if direction is as follows. We 
recall that both nondeterministic and deterministic modal mu-calculus have the tree model 
property (Streett & Emerson, 1989, 1984): if a formula has a model it has a tree model, 
i.e., a model having the form of a tree 11 . So without loss of generality we can restrict our 
attention to tree models only. Now there is a one-to-one transformation from tree models 



M T = (S T ,{Kf | i € C T },V T ) of $ to (tree) models M B = {S B ,{Tlf \ i € C B },V B ) of 



Indeed, we put S B = S T , V B = V T , C B = C7 ', and given a state x £ S T having as 



11. Given a model of $ we get a tree model simply by "unfolding" the original one. 



[R*]<f> 

[R + ]4> 
(R*)4> 
(R+)4> 



for 
for 
for 
for 



uX.((f)A[R]X) 
[R][R*}4> 

ux.(4>y (R)x) 

(R)(R*)ct>. 
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7\^-successors z\, . . . , zi, 12 we put (x, z\) 6 7£^, and (zj, 6 T^Jj , for i = 1, — 1. 
In this way we have (a;, 2$) 6 if and only if (x, zi) 6 TZjj o (7v^ nem )*. 13 

We remark that M T is required to be a tree because once we get M B we need to recover 
the "original" 7\!/^-predecessor a; of a state Zi, namely we need (JIr ° (Rr )*)~ to be 
a partial function, otherwise, given a state Zi, we would not know which of the various 
(TIr ° (Hr ew )*) "-successors is its original 7\!/^-predecessor x, and therefore we would not 
be able to reconstruct M T from M B . 

By interpreting R and R new as partial functions, it easy to express qualified number 
restrictions as constraints on the chain of (R o i?* ew )-successors of a state. For example: 
(< 3R.cfi) can be expressed by 

[R}[(Rne W )*}h4> V [(Rne W ) + ]h4> V [(Rne W ) + ]h4> V [(R new ) + h4>))) 

and can be read as "everywhere along the chain R o (R new )* there are at most three states 
where cj) holds", which corresponds exactly to the intended meaning. Similarly (> 3i?.0) 
can be expressed by 

(R)((R new y)(4> A ((Rn ew ) + )(4> A ((i? new ) + }0)) 

and can be read as "somewhere along the chain R o (R new )* there are at least three states 
where cj) holds" , which again corresponds exactly to the intended meaning. 
The above discussion allows us to state the following result. 

Theorem 8 There exists a polynomial function t mapping concepts of fiACCQ to formulae 
of deterministic modal mu-calculus such that: for any \xACCQ concept C , C is satisfiable 
if and only if u(C) is satisfiable. 

Proof The function t is defined inductively as follows: 



u(A) 


= A 


u(X) 


= X 


u(d n Ci) 


= u(C\)Au(C 2 ) 


u{d u c 2 ) 


= u(Ci)Vu(C 2 ) 


u{^C) 


= -.u(C) 


u(nX.C) 


= nX.u(C) 


u{vX.C) 


= uX.u(C) 


u{3R.C) 


= (R)((R new )*)u(C) 


u{VR.C) 


= [R][(R new y]u(C) 



where R new is a new role. Finally, (< nR.C) and (> nR.C) are mapped to the following 
formulae: 

12. We implicitly assume that M T is a finite branching tree model. This can be done without loss of 
generality since modal mu-calculus has the finite model property, and hence unfolding a finite model we 
get a finite branching tree model. Note however that it would suffice to assume M T to be a countable 
branching tree model. 

13. Note that this construction is similar to the one often used in programming to reduce n-ary trees to 
binary trees by coding children of a node as the combination of one child and its siblings. 
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«((< nR.C)) =[R][{R new y]{^u{C) V [{R new )+]{^u{C)V 

[(R new ) + }(. . . (MC) V [(R new ) + hu(C)) ■ ■ •))) 

where the number of nested formulae of the form -iit(C) V [(R new ) + ]Q is n, and 

«((> nR.C)) =( J R)((i? new )*)(«(C) A ((R new )+)(u(C)A 

{(Rne W ) + )(. ■ ■ (U(C) A ((R new ) + )U(C)) . . .))) 

where the number of nested formulae of the form u(C) A ((R new ) + )Q is n — 1. 

u(C) is clearly polynomial in the size of C (under the usual assumption that numbers in 
C coded in unary). Moreover, following the technique in (Parikh, 1981; Streett & Emerson, 
1984) that as been exposed above, it is easy to verify, by induction on the formation of the 
concept C, that the mapping t preserves satisfiability. □ 

It follows that we may transfer both decidability and complexity results for the de- 
terministic modal mu-calculus (Streett & Emerson, 1984; Emerson & Jutla, 1988; Safra, 
1988) to ixACCQ. Thus, we can immediately state what is the complexity of reasoning with 
\xALCQ, concepts and jiACCQ TBoxes. 

Theorem 9 Satisfiability of fiACCQ concepts, satisfiability of fiACCQ TBoxes, and logical 
implication in \xACCQ TBoxes are EXP TIME- complete problems. 

Proof Satisfiability in deterministic modal mu-calculus is an EXPTIME-complete problem 
(Streett & Emerson, 1984; Emerson & Jutla, 1988; Safra, 1988). Hence by Theorem 8 and 
Theorem 5 the thesis follows. □ 

6. Discussion and Conclusion 

The work presented in this paper stems out from (De Giacomo, 1993), where the basic ideas 
of introducing explicit fixpoint was first presented, and (De Giacomo & Lenzerini, 1994b), 
where such idea was further elaborated and fiACCQ was first introduced. 

One of the main contributions of this work has been to devise a tight correspondence 
between description logics with fixpoints and modal mu-calculus. In this respect we remark 
that, while fiACC corresponds directly to modal mu-calculus, the full fiACCQ corresponds 
to a variant of modal mu-calculus whose decidability and complexity had not been studied. 
More precisely, a notion essentially equivalent to that of qualified number restrictions has 
independently emerged in modal logics, namely that of graded modalities (Van der Hoek 
& de Rijke, 1995; Van der Hoek, 1992; Fattorosi-Barnaba & De Caro, 1985; Fine, 1972). 
However the combination of fixpoints and graded modalities had not been investigated 
before in the setting of modal logics. Given the tight correspondence between \xALC and 
modal mu-calculus, fiACCQ can be considered as modal mu-calculus augmented with graded 
modalities. Hence the results in this paper apply to such a logic as well. 

The research reported in this paper bears several similarities with the one on the cor- 
respondence between description logics and propositional dynamic logics (Baader, 1991; 
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Schild, 1991; De Giacomo & Lenzerini, 1994a, 1995; De Giacomo, 1995). In fact what char- 
acterize description logics based on propositional dynamic logics are the role constructs for 
chaining, choice, test, and above all reflexive transitive closure of roles, which is a limited 
form of fixpoint. Such role constructs can be easily expressed by using the explicit fixpoints 
introduced here. It suffice to resort to the following equivalences: 

3Ri oR 2 .C = 3R 1 3R 2 . C 
BRi U R 2 .C = BRi.C U BR 2 .C 
3R*.C = nX.{CU3R.X) 
3id{D).C = CHD. 

Note that MR*.C = vX.[C n VR.X). In (Calvanese, De Giacomo, & Lenzerini, 1995) 
a further implicit form of fixpoint has been advocated, the so called well-founded role 
construct wf(R). By explicit fixpoints, wf(R) is expressed simply as ixX.(\JR.X). 

Our proposal of allowing for fixpoint constructs explicitly in the formalism is shared 
by the study independently carried out by Schild in (Schild, 1994) 14 . The main goal of 
that work is to study both the expressive power and the computational complexity of 
subsumption and satisfiability for TBoxes expressed in ACC (no fixpoint constructs), that 
allow for mutually recursive definitions. To this end, a description logic is defined that 
corresponds to a variant of the modal mu-calculus in which mutual fixpoints are allowed 
but some restrictions on nested fixpoints are enforced (Vardi & Wolper, 1984). It is well 
known that mutual fixpoints can be re-expressed by means of nested ones (see, for example, 
Park, 1976; de Bakker, 1980). As a consequence of this observation it follows that the logic- 
introduced in this paper, is more expressive than the one analyzed in (Schild, 1994) since, 
on the one hand, it allows nesting of fixpoints without any restriction, on the other hand 
it makes it possible to state sophisticated forms of cardinality constraints on role fillers by 
means of qualified number restrictions. 

The present work can be extended along several directions. We conclude by outlining 
two of them. 

We already noticed that fixpoint constructs allow for representing not only abstract 
classes, but also several data structures extensively used in software development. We 
believe that this characteristic is an important step towards a satisfactory integration of 
description logics with both traditional and declarative programming systems. Indeed the 
description logic proposed in this paper provides powerful mechanisms for data structure 
modeling. In particular, the properties stated in Section 4.1 can be the base to formulate a 
notion of parametric concept 15 . For instance, the expression (named dag-of[Z]) 

\iX . emptydag U (Z l~l 3arc.T l~l "iarc.X) 

where Z is a formal parameter, denotes the class of DAGs whose nodes are left unspecified. 
This class can be used in several ways in the TBox. For example, it can be instantiated 
by binding the formal parameter to actual parameters, thus getting, say, dag-of [student] , 
dag jo j [person], etc., which are concepts inheriting the properties of dagjoj\Z]. 

14. In (Schild, 1994) number restrictions are not considered. 

15. Note that parametric concepts can be introduced also in simpler logics which do not include fixpoint 
constructs. 
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Although \xACCQ is a powerful logic, it lacks the construct for inverse roles which is 
needed for example to correctly capture the notions of (finite) TREE, BINARY-TREE, etc. 
Indeed, to define the concept of TREE (an EMPTY-TREE is a TREE; a NODE that has 
at most one parent, some children, and all children are TREEs, is a TREE; nothing else is a 
TREE) we can write tree = \iX . empty JtreeU {node l~l (< 1 child~ .T) n 3child.T n Vchild.X 
where child~ denotes the inverse of child. Notice that the introduction of inverse roles 
does not pose any difficulty from the semantical point of view; however, its impact on 
the reasoning method needs to be investigated. More generally, a wide variety of concept 
constructs can be studied in conjunction with fixpoints. The research on description logics 
related to propositional dynamic logics in (De Giacomo & Lenzerini, 1994a, 1995; Calvanese 
et al., 1995; De Giacomo, 1995) may give us hints on how to proceed along this direction. 
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